Over the last decade, evaluation of education programs has become big business, and this has had an impact on quality. When the first legislative mandate for evaluation was written into law as part of the 1965 Title I (ESEA) legislation, evaluation was considered to be an activity carried out at the local level for accountability and to improve the program. Every year thereafter, local evaluation activities were initiated for a. number of programs, usually coordinated by an evaluation specialist within the federal program office. As the number of activities grew, concern with quality and need for generally applicable procedures led to the establishment in fiscal 1970 of a central evaluation unit in OE (see Appendix A).
Funding
Before fiscal 1970, the Office of Education had about $1.25 million per year for central evaluation available. In that year, for the first time, there was a separate line item for evaluation.  The peak funding for the central evaluation unit was reached in 1978, with $29.7 million obligated for evaluation contracts.  In 1980, the amount had decreased to $19.4 million. The most precipitous drop within the unit came in evaluation funds for discretionary purposes, i.e., not earmarked for a specific title:  these funds dropped from $7.1 million in 1977 and 1978 to $3 million in 1980 (U.S. Department of Health, Education, and Welfare 1979b).
According to Reisner's estimate (Appendix A), in fiscal 1980 the Department of Education was planning to spend some $40 million on a variety of evaluation activities, half of the work being carried out by the central evaluation unit and nearly a quarter by the Inspector General.  If one wishes to calculate the total amount spent for program evaluation in education, that estimate needs to be augmented by the amount spent by the General Accounting Office (estimated at $2.5 million) and an unknown amount of federal funds devoted to evaluation it pits the insider's (e.g., clier 's, teacher's, program manager's) intuitive perception against the outsider's concern with quantitative assessment. The result is that they are unlikely t agree on goals. The focus of evaluation may then s if t
